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03/27/2002 
13:58:40 



APPLICANT: Millennium Pharmaceuticals, Inc. 
Meyers / Rachel 

TITLE OF INVENTION: 27419, A NOVEL HUMAN ARGININE-N -METHYL 

TRANSFERASE AND USES THEREOF 

FILE REFERENCE: 38155-20039.00 

CURRENT APPLICATION NUMBER: US 09/970,638 

CURRENT FILING DATE: 2002-03-12 

PRIOR APPLICATION NUMBER: US 60/237,717 

PRIOR FILING DATE: 2000-10-05 

NUMBER OF SEQ ID NOS : 9 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO: 1 
LENGTH: 1201 
TYPE: DNA 
ORGANISM 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



<110> 

<120> 

<130> 
<140> 
<141> 

<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 

attgaattta gcggccgcga attcgccctt aag atg teg cag ccc aag aaa aga 

Met Ser Gin Pro Lys Lys Arg 
1 5 
aag ctt gag teg ggg ggc ggc ggc gaa gga ggg gag gga act gaa gag 
Lys Leu Glu Ser Gly Gly Gly Gly Glu Gly Gly Glu Gly Thr Glu Glu 

10 15 20 

gaa gat ggc gcg gag egg gag gcg gec ctg gag cga ccc egg agg act 
Glu Asp Gly Ala Glu Arg Glu Ala Ala Leu Glu Arg Pro Arg Arg Thr 



Homo sapiens 



CDS 

(34) 

1 



(1161) 



25 30 
aag egg gaa egg gac cag ctg tac tac gag 
Lys Arg Glu Arg Asp Gin Leu Tyr Tyr Glu 

40 45 
gtc cac gag gag atg ate gcg gac cgc gtc 
Val His Glu Glu Met lie Ala Asp Arg Val 
60 65 
ctg ggt ate ctt egg aac tgg gca gca ctg 



35 



tgc 
Cys 
50 
cgc 
Arg 

cga 



Leu Gly lie Leu Arg Asn Trp Ala Ala 

75 80 

gac gtg ggc gcg ggc acc ggc att ctg 

Asp Val Gly Ala Gly Thr Gly lie Leu 

90 95 

ggg gee egg cgc gtg tac gcg gta gag 

Gly Ala Arg Arg Val Tyr Ala Val Glu 
105 110 



tac teg gac gtt teg 

Tyr Ser Asp Val Ser 
55 

acc gat gee tac cgc 

Thr Asp Ala Tyr Arg 
70 

ggc aag acg gta ctg 

Leu Arg Gly Lys Thr Val Leu 
85 

ttc tgt gee cag gee 

Phe Cys Ala Gin Ala 
100 

age gee ate tgg caa cag 

Ser Ala lie Trp Gin Gin 
115 



age 
Ser 

gee 
Ala 



ate 
He 




54 



102 



150 



198 



246 



294 



342 



390 
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DATE: 03/27/2002 
TIME: 13:58:40 



RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/970,638 

Input Set : F:\38155-20039.txt 

Output Set: N:\CRF3\03272002\l970638.raw 

64 gcc egg gag gtg gtg egg ttc aac ggg ctg gag gac egg gtg cac gtc 

65 Ala Arg Glu Val Val Arg Phe Asn Gly Leu Glu Asp Arg Val His Val 

66 120 125 130 135 
ctg ccg gga cca gtg gag act gta gag ttg ccg gaa cag gtg gat gcc 
Leu Pro Gly Pro Val Glu Thr Val Glu Leu Pro Glu Gin Val Asp Ala 

140 145 150 

ate gtg age gag tgg atg ggc tac gga etc ctg cac gag tec atg ctg 

lie Val Ser Glu Trp Met Gly Tyr Gly Leu Leu His Glu Ser Met Leu 

155 160 165 

76 age tec gtc etc cac gcg cga ace aag tgg ctg aag gag ggc ggt ctt 

77 Ser Ser Val Leu His Ala Arg Thr Lys Trp Leu Lys Glu Gly Gly Leu 
170 175 180 

etc ctg ccg gcc tec gcc gag etc ttc ata gcc ccc ate age gac cag 

Leu Leu Pro Ala Ser Ala Glu Leu Phe lie Ala Pro lie Ser Asp Gin 

185 190 195 

atg ctg gaa tgg cgc ctg ggc ttc tgg age cag gtg aag cag cac tat 



68 
69 
70 
72 
73 
74 



78 
80 
81 
82 
84 



85 Met Leu Glu Trp Arg Leu Gly Phe Trp Ser Gin Val Lys Gin His Tyr 



86 200 



89 
90 



94 
96 
97 
98 



205 



210 



215 



88 ggt gtg gac atg age tgc ctg gag ggc ttc gcc acg cgc tgt etc atg 



Gly Val Asp Met Ser Cys Leu Glu Gly Phe Ala Thr Arg Cys Leu Met 
220 225 230 

92 ggc cac teg gag ate gtt gtg cag gga ttg tec ggc gag gac gtg ctg 

93 Gly His Ser Glu He Val Val Gin Gly Leu Ser Gly Glu Asp Val Leu 
235 240 245 

gcc egg ccg cag cgc ttt get cag eta gag etc tec cgc gcc ggc ttg 
Ala Arg Pro Gin Arg Phe Ala Gin Leu Glu Leu Ser Arg Ala Gly Leu 
250 255 260 

100 gag cag gag ctg gag gcc gga gtg ggc ggg cgc ttc cgc tgc age tgc 

101 Glu Gin Glu Leu Glu Ala Gly Val Gly Gly Arg Phe Arg Cys Ser Cys 

102 265 270 275 

104 tat ggc teg gcg ccc atg cat ggc ttt gcc ate tgg ttc cag gtg ace 

105 Tyr Gly Ser Ala Pro Met His Gly Phe Ala He Trp Phe Gin Val Thr 

106 280 285 290 295 

108 ttc cct gga ggg gag teg gag aaa ccc ctg gtg ctg tec acc teg cct 

109 Phe Pro Gly Gly Glu Ser Glu Lys Pro Leu Val Leu Ser Thr Ser Pro 

110 300 305 310 

112 ttt cac ccg gcc act cac tgg aaa cag gcg etc etc tac ctg aac gag 

113 Phe His Pro Ala Thr His Trp Lys Gin Ala Leu Leu Tyr Leu Asn Glu 

114 315 320 325 

116 ccg gtg caa gtg gag caa gac acg gac gtt tea gga gag ate acg ctg 

117 Pro Val Gin Val Glu Gin Asp Thr Asp Val Ser Gly Glu He Thr Leu 

118 330 335 340 

120 ctg ccc tec egg cac aac ccc cgt cgc ctg cgc gtg ctg ctg cgc tac 

121 Leu Pro Ser Arg His Asn Pro Arg Arg Leu Arg Val Leu Leu Arg Tyr 

122 345 350 355 

124 aaa gtg gga gac cag gag gag aag acc aaa gac ttt gcc atg gag gac 

125 Lys Val Gly Asp Gin Glu Glu Lys Thr Lys Asp Phe Ala Met Glu Asp 

126 360 365 370 375 
128 tga gcgttgccaa gggegaatte gtttaaacct gcaggactag 



inn 



3> 



438 



:x> oo 

CO 

o 2 

c=> ° 

486 ^3 ^ 

CO 

<c=> 

534 



582 



rr 



630 



678 



726 



774 



822 



870 



918 



966 



1014 



1062 



1110 



1158 



1201 
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129 * 

133 <210> SEQ ID NO: 2 

134 <211> LENGTH: 375 

135 <212> TYPE: PRT 

136 <213> ORGANISM: Homo sapiens 

138 <400> SEQUENCE: 2 

139 Met Ser Gin Pro Lys Lys Arg Lys Leu Glu Ser Gly Gly Gly Gly Glu 

140 15 10 15 

141 Gly Gly Glu Gly Thr Glu Glu Glu Asp Gly Ala Glu Arg Glu Ala Ala 

142 20 25 30 

143 Leu Glu Arg Pro Arg Arg Thr Lys Arg Glu Arg Asp Gin Leu Tyr Tyr 

144 35 40 45 

14 5 Glu Cys Tyr Ser Asp Val Ser Val His Glu Glu Met He Ala Asp Arg 

146 50 55 60 

147 Val Arg Thr Asp Ala Tyr Arg Leu Gly He Leu Arg Asn Trp Ala Ala 

148 65 70 75 80 

149 Leu Arg Gly Lys Thr Val Leu Asp Val Gly Ala Gly Thr Gly He Leu 

150 85 90 95 

151 Ser He Phe Cys Ala Gin Ala Gly Ala Arg Arg Val Tyr Ala Val Glu 

152 100 105 HO 

153 Ala Ser Ala He Trp Gin Gin Ala Arg Glu Val Val Arg Phe Asn Gly 

154 115 120 125 

155 Leu Glu Asp Arg Val His Val Leu Pro Gly Pro Val Glu Thr Val Glu 

156 130 135 140 

157 Leu Pro Glu Gin Val Asp Ala He Val Ser Glu Trp Met Gly Tyr Gly 

158 145 150 155 160 

159 Leu Leu His Glu Ser Met Leu Ser Ser Val Leu His Ala Arg Thr Lys 

160 165 170 175 

161 Trp Leu Lys Glu Gly Gly Leu Leu Leu Pro Ala Ser Ala Glu Leu Phe 

162 180 185 190 

163 He Ala Pro He Ser Asp Gin Met Leu Glu Trp Arg Leu Gly Phe Trp 

164 195 200 205 

165 Ser Gin Val Lys Gin His Tyr Gly Val Asp Met Ser Cys Leu Glu Gly 

166 210 215 220 

167 Phe Ala Thr Arg Cys Leu Met Gly His Ser Glu He Val Val Gin Gly 

168 225 230 235 240 

169 Leu Ser Gly Glu Asp Val Leu Ala Arg Pro Gin Arg Phe Ala Gin Leu 

170 245 250 255 

171 Glu Leu Ser Arg Ala Gly Leu Glu Gin Glu Leu Glu Ala Gly Val Gly 

172 260 265 270 

173 Gly Arg Phe Arg Cys Ser Cys Tyr Gly Ser Ala Pro Met His Gly Phe 

174 275 280 285 

175 Ala He Trp Phe Gin Val Thr Phe Pro Gly Gly Glu Ser Glu Lys Pro 

176 290 295 300 

177 Leu Val Leu Ser Thr Ser Pro Phe His Pro Ala Thr His Trp Lys Gin 

178 305 310 315 320 

179 Ala Leu Leu Tyr Leu Asn Glu Pro Val Gin Val Glu Gin Asp Thr Asp 

180 325 330 335 

181 Val Ser Gly Glu He Thr Leu Leu Pro Ser Arg His Asn Pro Arg Arg 
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182 



340 



345 



350 



183 Leu Arg Val Leu Leu Arg Tyr Lys Val Gly Asp Gin Glu Glu Lys Thr 



184 



355 



360 



365 



185 Lys Asp Phe Ala Met Glu Asp 

186 370 375 

188 <210> SEQ ID NO: 3 

189 <211> LENGTH: 1128 

190 <212> TYPE: DNA 

191 <213> ORGANISM: Homo sapiens 

193 <400> SEQUENCE: 3 

194 atgtcgcagc ccaagaaaag aaagcttgag tcggggggcg gcggcgaagg aggggaggga 

195 actgaagagg aagatggcgc ggagcgggag gcggccctgg agcgaccccg gaggactaag 

196 cgggaacggg accagctgta ctacgagtgc tactcggacg tttcggtcca cgaggagatg 

197 atcgcggacc gcgtccgcac cgatgcctac cgcctgggta tccttcggaa ctgggcagca 

198 ctgcgaggca agacggtact ggacgtgggc gcgggcaccg gcattctgag catcttctgt 

199 gcccaggccg gggcccggcg cgtgtacgcg gtagaggcca gcgccatctg gcaacaggcc 

200 cgggaggtgg tgcggttcaa cgggctggag gaccgggtgc acgtcctgcc gggaccagtg 

201 gagactgtag agttgccgga acaggtggat gccatcgtga gcgagtggat gggctacgga 

202 ctcctgcacg agtccatgct gagctccgtc ctccacgcgc gaaccaagtg gctgaaggag 

203 ggcggtcttc tcctgccggc ctccgccgag ctcttcatag cccccatcag cgaccagatg 

204 ctggaatggc gcctgggctt ctggagccag gtgaagcagc actatggtgt ggacatgagc 

205 tgcctggagg gcttcgccac gcgctgtctc atgggccact cggagatcgt tgtgcaggga 

206 ttgtccggcg aggacgtgct ggcccggccg cagcgctttg ctcagctaga gctctcccgc 

207 gccggcttgg agcaggagct ggaggccgga gtgggcgggc gcttccgctg cagctgctat 

208 ggctcggcgc ccatgcatgg ctttgccatc tggttccagg tgaccttccc tggaggggag 

209 tcggagaaac ccctggtgct gtccacctcg ccttttcacc cggccactca ctggaaacag 

210 gcgctcctct acctgaacga gccggtgcaa gtggagcaag acacggacgt ttcaggagag 

211 atcacgctgc tgccctcccg gcacaacccc cgtcgcctgc gcgtgctgct gcgctacaaa 

212 gtgggagacc aggaggagaa gaccaaagac tttgccatgg aggactga 

214 <210> SEQ ID NO: 4 

215 <211> LENGTH: 304 

216 <212> TYPE: PRT 

217 <213> ORGANISM: Artificial Sequence 
219 <220> FEATURE: 

<223> OTHER INFORMATION: Consensus amino acid sequence 
<400> SEQUENCE: 4 

lie Pro Thr Ser Tyr Asp lie lie Gly Asp lie Val lie Leu Asn lie 

1 5 10 15 

Pro Asp Glu Leu Arg Pro Tyr Arg Lys Val lie Gly Glu Ala lie Leu 
20 25 30 



220 
222 
223 
224 
225 
226 



227 Lys Lys Thr Lys Lys Asn Glu Met Ala Val Val Lys Thr Val Leu Arg 



228 



35 



40 



45 



229 Lys Gly Gly Glu Glu Asp Phe Glu Gly Leu Leu Gly Glu Gly Tyr Gly 



230 
231 
232 



50 55 60 

lie Thr Gly Glu Tyr Arg Thr Arg Arg Glu Leu Leu Ala Gly Glu Lys 
65 70 75 80 



233 Asp Ser Gly Glu Thr Val Thr He His Lys Glu Asn Gly Cys Arg Phe 



234 



85 



90 



95 



2 35 Lys Leu Asp Val Ser Lys Val Tyr Phe Ser Pro Arg Leu Glu Gly Glu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1128 
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236 








1 A A 

100 










1UD 










1 1 n 

11U 






237 


Arg 


Glu 


Arg 


Leu 


Ala 


Lys 


Leu 


vai 


Lys 


pin 

bill 


L»iy 


H 1 ii 

VjlU 


vai 


VQl 


Va 1 
val 


A on 


238 






115 










i on 
lzU 










±ZD 








239 


Met 


Phe 


Ala 


Gly 


T ~l ~ 

lie 


Gly 


Pro 


Phe 


Ser 


lie 


Pro 


Tl a 

lie 


Ala 

Ala 


Lys 


U-i G 

illo 


Ocrl 


240 




130 










135 










i a n 
14 u 










241 


Lys 


Ala 


Lys 


Arg 


Val 


Tyr 


Ala 


Val 


Asp 


Leu 


Asn 


Pro 


Pin 
VjIU 


Ala 


Va 1 
Val 


Lys 


242 


145 










1 K A 

150 










Ijj 










i fin 

1 O V 


243 


Tyr 


Leu 


Lys 


Glu 


Asn 


Tl- 


Lys 


Leu 


Asn 


Lys 


vai 


v?lU 


Asp 


Pin 

bin 


Pro 


Lys 


244 










165 










i t n 
1 / u 










1 7 s 

i / j 




245 


lie 


Gin 


Thr 


Val 


Val 


Asn 


Lys 


lie 


lie 


Pro 


Tin 

lie 


Leu 


H 1 \T 

oiy 


Asp 


Va 1 
Val 


Arg 


246 








loO 




















lUw 






247 


Glu 


Val 


He 


Pro 


Glu 


Lys 


Glu 


Ala 


(jiy 


vai 


IllS 


Ser 


Leu 


Va 1 
Val 


Lys 


JucU 


248 






IOC 

195 










zw 










90S 








249 


Val 


Val 


Aia 


Asp 


Arg 


vai 


Tl n 

lie 




Pro 


Leu 


Dta 
rl (J 


IjcU 


Ala 


A yrr 


Va 1 

v a x 


vjxy 


250 




210 










ZlO 










990 

Z £i w 










TCI 

251 


Gin 


He 


Ser 


Ala 


His 


(jIU 


jrne 


Leu 


Asp 


T ire 
l-i y o 


Ala 
rlia 


JJCL1 


A yrr 
n.1 y 


Lys 


Val 


Lys 


252 


225 










ZoU 










0 *5 C\ 
ZoD 










940 

^ *x If 


253 


Asp 


Gly 


Gly 


He 


Leu 


Val 


Ser 


Val 


He 


nib 


Tyr 


j.yr 


OX u. 


lit? 


ucu 


Pro 


254 










245 










ZJU 










9S^ 
a j j 




255 


Glu 


Lys 


Ala 


He 


Pro 


Phe 


Leu 


Pro 


Val 


Glu 


Arg 


He 


Lys 


Arg 


aia 


Ala 


256 






260 










265 










270 






257 


Glu 


Lys 


Glu 


Gly 


Val 


Lys 


Val 


Glu 


He 


Leu 


Lys 


Lys 


Arg 


Lys 


Va 1 
Val 


Lys 


258 






275 










280 










285 








259 


Asn 


Tyr 


Ala 


Pro 


Gly 


Val 


Tyr 


His 


Val 


Val 


Leu 


Asp 


Ala 


Arg 


Va 1 
v a i 




260 




290 










295 










300 










262 


<210> SEQ ID NO 


5 
























263 


<211> LENGTH: 185 
























264 


<212> TYPE: 


PRT 


























265 


<213> ORGANISM: 


Artificial Sequence 
















267 


<220> FEATURE: 


























268 


<223> OTHER 


INFORMATION 


■ Consensus amino acid sequence 






270 


<400> SEQUENCE: 


5 
























271 


Pro 


Asp 


Lys 


Cys 


Thr 


Leu 


His 


Leu 


Thr 


Ala 


He 


Glu 


Asp 


Thr 


Pin 

oin 


Tyr 


272 


1 








5 










10 










1 D 




273 


Lys 


Asp 


Glu 


Lys 


He 


Asn 


Phe 


Trp 


Asp 


Asp 


Val 


Tyr 


Gly 


Phe 


Asx 


flc L. 


274 








20 










25 










30 






275 


Ser 


Cys 


Met 


Lys 


Asp 


Ala 


Ala 


He 


Lys 


Glu 


Pro 


He 


Val 


Asp 


Pro 


r 

Lys 


O ^ C 

27 b 






35 










40 










45 








277 


Tyr 


Asn 


His 


Val 


Val 


Asp 


Pro 


Asn 


Gin 


He 


He 


Thr 


Asp 


Pro 


Cys 


Thr 


278 




50 










55 










60 










279 


He 


Lys 


Glu 


Phe 


Asp 


Met 


His 


Thr 


Val 


Lys 


He 


Ser 


Asp 


Leu 


Glu 


Thr 


280 


65 










70 










75 










80 


281 


Phe 


Thr 


Ser 


Pro 


Phe 


Lys 


Leu 


Thr 


Val 


Lys 


Arg 


Asn 


Gly 


Met 


Leu 


His 


282 










85 










90 










95 




283 


Ala 


Phe 


Thr 


Ala 


Trp 


Phe 


Asp 


Val 


Tyr 


Phe 


Thr 


Ser 


Lys 


Arg 


Cys 


His 


284 








100 










105 










110 






285 


Pro 


He 


Pro 


Glu 


Ala 


He 


Asp 


Lys 


Pro 


Val 


Gly 


Phe 


Ser 


Thr 


Gly 


Pro 


286 






115 










120 










125 








287 


Gin 


Ser 


Pro 


Tyr 


Thr 


His 


Trp 


Lys 


Gin 


Thr 


He 


Phe 


Tyr 


Met 


Glu 


Asp 
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